Genetic code, hamming distance and stochastic matrices.

نویسندگان

  • Matthew X He
  • Sergei V Petoukhov
  • Paolo E Ricci
چکیده

In this paper we use the Gray code representation of the genetic code C=00, U=10, G=11 and A=01 (C pairs with G, A pairs with U) to generate a sequence of genetic code-based matrices. In connection with these code-based matrices, we use the Hamming distance to generate a sequence of numerical matrices. We then further investigate the properties of the numerical matrices and show that they are doubly stochastic and symmetric. We determine the frequency distributions of the Hamming distances, building blocks of the matrices, decomposition and iterations of matrices. We present an explicit decomposition formula for the genetic code-based matrix in terms of permutation matrices, which provides a hypercube representation of the genetic code. It is also observed that there is a Hamiltonian cycle in a genetic code-based hypercube.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Reed-Muller Codec Simulation Performance

The approach to error correction coding taken by modern digital communication systems started in the late 1940’s with the ground breaking work of Shannon, Hamming and Golay. ReedMuller (RM) codes were an important step beyond the Hamming and Golay codes because they allowed more flexibility in the size of the code word and the number of correctable errors per code word. Whereas the Hamming and ...

متن کامل

The Graph, Geometry and Symmetries of the Genetic Code with Hamming Metric

The similarity patterns of the genetic code result from similar codons encoding similar messages. We develop a new mathematical model to analyze these patterns. The physicochemical characteristics of amino acids objectively quantify their differences and similarities; the Hamming metric does the same for the 64 codons of the codon set. (Hamming distances equal the number of different codon posi...

متن کامل

Point Representation for Local Optimization Towards Multi-Dimensional Gray Codes

In the context of stochastic search, once regions of high performance are found, having the property that small changes in the candidate solution correspond to searching nearby neighborhoods provides the ability to perform effective local optimization. To achieve this, Gray Codes are often employed for encoding ordinal points or discretized real numbers. In this paper, we present a method to la...

متن کامل

The Genetic Code Boolean Lattice

The algebraic structures of the genetic code are most important to obtain additional information about the semantic code and its applications. In this paper we define two dual Boolean codon lattices of the genetic code using hydrogen bond numbers and the chemical types of bases: purines and pyrimidines. The Boolean lattices reflect the role of hydrophobicity in the distribution of codon assignm...

متن کامل

Studying genetic code by a matrix approach.

Following Petoukhov and his collaborators, we use two length n zero-one sequences, alpha and beta, to represent a length n genetic sequence (alpha/beta) so that the columns of (alpha/beta) have the following correspondence with the nucleotides: C ~ (0/0), U ~ (1/0), G ~ (1/1), A ~ (0/1). Using the Gray code ordering to arrange alpha and beta, we build a 2(n) x 2(n) matrix C(n) including all the...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Bulletin of mathematical biology

دوره 66 5  شماره 

صفحات  -

تاریخ انتشار 2004